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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Chen, J. Don 
Li , Hui 

(ii) TITLE OF INVENTION: Transcriptional Coactivator for Nuclear 
10 Hormone Receptors 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Lahive and Cockfield 

(B) STREET: 28 State Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 
20 (F) ZIP: 02109 

D (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
p (B) COMPUTER: IBM PC compatible 

^25 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

C± (D) SOFTWARE: Patent In Release #1.0, Version #1.2 5 

3 (vi) CURRENT APPLICATION DATA: 

^ (A) APPLICATION NUMBER: 

7"30 (B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Liepmann, W. Hugo 
W35 (b) REGISTRATION NUMBER: 20,407 

ffl (C) REFERENCE/DOCKET NUMBER: UMM-02 6 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 617-227-7400 
40 (B) TELEFAX: 617-742-4214 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
55 (A) NAME/KEY: CDS 

(B) LOCATION: 86.. 4338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GCTGGATGGT GGACTCAGAG ACCAATAAAA ATAAACTGCT TGAACATCCT TTGACTGGTT 60 

5 

AGCCAGTTGC TGATGTATAT TCAAG ATG AGT GGA TTA GGA GAA AAC TTG GAT 112 

Met Ser Gly Leu Gly Glu Asn Leu Asp 
1 5 

10 CCA CTG GCC AGT GAT TCA CGA AAA CGC AAA TTG CCA TGT GAT ACT CCA 160 
Pro Leu Ala Ser Asp Ser Arg Lys Arg Lys Leu Pro Cys Asp Thr Pro 
10 15 20 25 

GGA CAA GGT CTT ACC TGC AGT GGT GAA AAA CGG AGA CGG GAG CAG GAA 2 08 

15 Gly Gin Gly Leu Thr Cys Ser Gly Glu Lys Arg Arg Arg Glu Gin Glu 

30 35 40 

AGT AAA TAT ATT GAA GAA TTG GCT GAG CTG ATA TCT GCC AAT CTT AGT 2 56 

Ser Lys Tyr lie Glu Glu Leu Ala Glu Leu lie Ser Ala Asn Leu Ser 
20 45 50 55 

O GAT ATT GAC AAT TTC AAT GTC AAA CCA GAT AAA TGT GCG ATT TTA AAG 3 04 

%0 Asp lie Asp Asn Phe Asn Val Lys Pro Asp Lys Cys Ala lie Leu Lys 

Q 60 65 70 
^25 

M-- GAA ACA GTA AGA CAG ATA CGT CAA ATA AAA GAG CAA GGA AAA ACT ATT 3 52 

^ Glu Thr Val Arg Gin lie Arg Gin lie Lys Glu Gin Gly Lys Thr lie 

75 80 85 

30 TCC AAT GAT GAT GAT GTT CAA AAA GCC GAT GTA TCT TCT ACA GGG CAG 400 
=~ Ser Asn Asp Asp Asp Val Gin Lys Ala Asp Val Ser Ser Thr Gly Gin 

r] 90 95 100 105 

[7: GGA GTT ATT GAT AAA GAC TCC TTA GGA CCG CTT TTA CTT CAG GCA TTG 44 8 

^35 Gly Val lie Asp Lys Asp Ser Leu Gly Pro Leu Leu Leu Gin Ala Leu 
^ 110 115 120 

GAT GGT TTC CTA TTT GTG GTG AAT CGA GAG GCA AAC ATT GTA TTT GTA 4 96 

Asp Gly Phe Leu Phe Val Val Asn Arg Glu Ala Asn lie Val Phe Val 
40 125 130 135 

TCA GAA AAT GTC ACA CAA TAC CTG CAA TAT AAG CAA GAG GAC CTG GTT 544 

Ser Glu Asn Val Thr Gin Tyr Leu Gin Tyr Lys Gin Glu Asp Leu Val 

140 145 150 

45 

AAC ACA AGT GTT TAC AAT ATC TTA CAT GAA GAA GAC AGA AAG GAT TTT 5 92 

Asn Thr Ser Val Tyr Asn lie Leu His Glu Glu Asp Arg Lys Asp Phe 
155 160 165 

50 CTT AAG AAT TTA CCA AAA TCT ACA GTT AAT GGA GTT TCC TGG ACA AAT 64 0 

Leu Lys Asn Leu Pro Lys Ser Thr Val Asn Gly Val Ser Trp Thr Asn 
170 175 180 185 

GAG ACC CAA AGA CAA AAA AGC CAT ACA TTT AAT TGC CGT ATG TTG ATG 688 
55 Glu Thr Gin Arg Gin Lys Ser His Thr Phe Asn Cys Arg Met Leu Met 

190 195 200 
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AAA ACA CCA CAT GAT ATT CTG GAA GAC ATA AAC GCC AGT CCT GAA ATG 73 6 

Lys Thr Pro His Asp lie Leu Glu Asp lie Asn Ala Ser Pro Glu Met 
205 210 215 

5 CGC CAG AGA TAT GAA ACA ATG CAG TGC TTT GCC CTG TCT CAG CCA CGA 7 84 

Arg Gin Arg Tyr Glu Thr Met Gin Cys Phe Ala Leu Ser Gin Pro Arg 
220 225 230 

GCT ATG ATG GAG GAA GGG GAA GAT TTG CAA TCT TGT ATG ATC TGT GTG 83 2 

10 Ala Met Met Glu Glu Gly Glu Asp Leu Gin Ser Cys Met lie Cys Val 
235 240 245 

GCA CGC CGC ATT ACT ACA GGA GAA AGA ACA TTT CCA TCA AAC CCT GAG 88 0 

Ala Arg Arg lie Thr Thr Gly Glu Arg Thr Phe Pro Ser Asn Pro Glu 
15 250 255 260 265 

AGC TTT ATT ACC AGA CAT GAT CTT TCA GGA AAG GTT GTC AAT ATA GAT 92 8 

Ser Phe lie Thr Arg His Asp Leu Ser Gly Lys Val Val Asn lie Asp 
270 275 280 

20 

ACA AAT TCA CTG AGA TCC TCC ATG AGG CCT GGC TTT GAA GAT ATA ATC 976 
Thr Asn Ser Leu Arg Ser Ser Met Arg Pro Gly Phe Glu Asp lie lie 
285 290 295 

25 CGA AGG TGT ATT CAG AGA TTT TTT AGT CTA AAT GAT GGG CAG TCA TGG 1024 
Arg Arg Cys lie Gin Arg Phe Phe Ser Leu Asn Asp Gly Gin Ser Trp 
300 305 310 

TCC CAG AAA CGT CAC TAT CAA GAA GCT TAT CTT AAT GGC CAT GCA GAA 10 72 

30 Ser Gin Lys Arg His Tyr Gin Glu Ala Tyr Leu Asn Gly His Ala Glu 
315 320 325 

ACC CCA GTA TAT CGA TTC TCG TTG GCT GAT GGA ACT ATA GTG ACT GCA 112 0 

Thr Pro Val Tyr Arg Phe Ser Leu Ala Asp Gly Thr lie Val Thr Ala 
35 330 335 340 345 

CAG ACA AAA AGC AAA CTC TTC CGA AAT CCT GTA ACA AAT GAT CGA CAT 116 8 

Gin Thr Lys Ser Lys Leu Phe Arg Asn Pro Val Thr Asn Asp Arg His 
350 355 360 

40 

GGC TTT GTC TCA ACC CAC TTC CTT CAG AGA GAA CAG AAT GGA TAT AGA 1216 
Gly Phe Val Ser Thr His Phe Leu Gin Arg Glu Gin Asn Gly Tyr Arg 
365 370 375 

45 CCA AAC CCA AAT CCT GTT GGA CAA GGG ATT AGA CCA CCT ATG GCT GGA 12 64 

Pro Asn Pro Asn Pro Val Gly Gin Gly lie Arg Pro Pro Met Ala Gly 
380 385 390 

TGC AAC AGT TCG GTA GGC GGC ATG AGT ATG TCG CCA AAC CAA GGC TTA 1312 
50 Cys Asn Ser Ser Val Gly Gly Met Ser Met Ser Pro Asn Gin Gly Leu 
395 400 405 

CAG ATG CCG AGC AGC AGG GCC TAT GGC TTG GCA GAC CCT AGC ACC ACA 13 6 0 

Gin Met Pro Ser Ser Arg Ala Tyr Gly Leu Ala Asp Pro Ser Thr Thr 
55 410 415 420 425 

GGG CAG ATG AGT GGA GCT AGG TAT GGG GGT TCC AGT AAC ATA GCT TCA 14 08 
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Gly Gin Met Ser Gly Ala Arg Tyr Gly Gly Ser Ser Asn lie Ala Ser 
430 435 440 

TTG ACC 'CCT GGG CCA GGC ATG CAA TCA CCA TCT TCC TAC GAG AAC AAC 14 56 

5 Leu Thr Pro Gly Pro Gly Met Gin Ser Pro Ser Ser Tyr Gin Asn Asn 
445 450 455 

AAC TAT GGG CTC AAC ATG AGT AGC CCC CCA CAT GGG AGT CCT GGT CTT 1504 
Asn Tyr Gly Leu Asn Met Ser Ser Pro Pro His Gly Ser Pro Gly Leu 
10 460 465 470 

GCC CCA AAC CAG CAG AAT ATC ATG ATT TCT CCT CGT AAT CGT GGG AGT 1552 
Ala Pro Asn Gin Gin Asn lie Met lie Ser Pro Arg Asn Arg Gly Ser 
475 480 485 

15 

CCA AAG ATA GCC TCA CAT CAG TTT TCT CCT GTT GCA GGT GTG CAC TCT 16 0 0 

Pro Lys lie Ala Ser His Gin Phe Ser Pro Val Ala Gly Val His Ser 
490 495 500 505 

20 CCC ATG GCA TCT TCT GGC AAT ACT GGG AAC CAC AGC TTT TCC AGC AGC 164 8 

Pro Met Ala Ser Ser Gly Asn Thr Gly Asn His Ser Phe Ser Ser Ser 
510 515 520 

TCT CTC AGT GCC CTG CAA GCC ATC AGT GAA GGT GTG GGG ACT TCC CTT 16 96 

25 Ser Leu Ser Ala Leu Gin Ala lie Ser Glu Gly Val Gly Thr Ser Leu 
525 530 535 

TTA TCT ACT CTG TCA TCA CCA GGC CCC AAA TTG GAT AAC TCT CCC AAT 1744 
Leu Ser Thr Leu Ser Ser Pro Gly Pro Lys Leu Asp Asn Ser Pro Asn 
30 540 545 550 

ATG AAT ATT ACC CAA CCA AGT AAA GTA AGC AAT CAG GAT TCC AAG AGT 17 92 

Met Asn lie Thr Gin Pro Ser Lys Val Ser Asn Gin Asp Ser Lys Ser 

555 560 565 

35 

CCT CTG GGC TTT TAT TGC GAC CAA AAT CCA GTG GAG AGT TCA ATG TGT 184 0 

Pro Leu Gly Phe Tyr Cys Asp Gin Asn Pro Val Glu Ser Ser Met Cys 

570 575 580 585 

40 CAG TCA AAT AGC AGA GAT CAC CTC AGT GAC AAA GAA AGT AAG GAG AGC 18 8 8 

Gin Ser Asn Ser Arg Asp His Leu Ser Asp Lys Glu Ser Lys Glu Ser 
590 595 600 

AGT GTT GAG GGG GCA GAG AAT CAA AGG GGT CCT TTG GAA AGC AAA GGT 193 6 

45 Ser Val Glu Gly Ala Glu Asn Gin Arg Gly Pro Leu Glu Ser Lys Gly 
605 610 615 

CAT AAA AAA TTA CTG CAG TTA CTT ACC TGT TCT TCT GAT GAC CGG GGT 1984 
His Lys Lys Leu Leu Gin Leu Leu Thr Cys Ser Ser Asp Asp Arg Gly 
50 620 625 630 

CAT TCC TCC TTG ACC AAC TCC CCC CTA GAT TCA AGT TGT AAA GAA TCT 2 032 

His Ser Ser Leu Thr Asn Ser Pro Leu Asp Ser Ser Cys Lys Glu Ser 
635 640 645 
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TCT GTT AGT GTC ACC AGC CCC TCT GGA GTC TCC TCC TCT ACA TCT GGA 2 080 

Ser Val Ser Val Thr Ser Pro Ser Gly Val Ser Ser Ser Thr Ser Gly 
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650 655 660 665 

GGA GTA TCC TCT ACA TCC AAT ATG CAT GGG TCA CTG TTA CAA GAG AAG 212 8 

Gly Val Ser Ser Thr Ser Asn Met His Gly Ser Leu Leu Gin Glu Lys 
670 675 680 

CAC CGG ATT TTG CAC AAG TTG CTG CAG AAT GGG AAT TCA CCA GCT GAG 2176 
His Arg lie Leu His Lys Leu Leu Gin Asn Gly Asn Ser Pro Ala Glu 
685 690 695 

GTA GCC AAG ATT ACT GCA CAA GCC ACT GGG AAA GAC ACC AGC AGT ATA 2 2 24 

Val Ala Lys lie Thr Ala Gin Ala Thr Gly Lys Asp Thr Ser Ser lie 
700 705 710 

15 ACT TCT TGT GGG GAC GGA AAT GTT GTC AAG CAG GAG CAG CTA AGT CCT 22 72 

Thr Ser Cys Gly Asp Gly Asn Val Val Lys Gin Glu Gin Leu Ser Pro 
715 720 725 

AAG AAG AAG GAG AAT AAT GCA CTT CTT AGA TAC CTG CTG GAC AGG GAT 2 320 

20 Lys Lys Lys Glu Asn Asn Ala Leu Leu Arg Tyr Leu Leu Asp Arg Asp 
730 735 740 745 

GAT CCT AGT GAT GCA CTC TCT AAA GAA CTA CAG CCC CAA GTG GAA GGA 2 368 

Asp Pro Ser Asp Ala Leu Ser Lys Glu Leu Gin Pro Gin Val Glu Gly 
25 750 755 760 

GTG GAC AAT AAA ATG AGT CAG TGC ACC AGC TCC ACC ATT CCT AGC TCA 2416 

Val Asp Asn Lys Met Ser Gin Cys Thr Ser Ser Thr lie Pro Ser Ser 

765 770 775 

30 

AGT CAA GAG AAA GAC CCT AAA ATT AAG ACA GAG ACA AGT GAA GAG GGA 24 64 

Ser Gin Glu Lys Asp Pro Lys lie Lys Thr Glu Thr Ser Glu Glu Gly 

780 785 790 

35 TCT GGA GAC TTG GAT AAT CTA GAT GCT ATT CTT GGT GAT CTG ACT AGT 2 512 

Ser Gly Asp Leu Asp Asn Leu Asp Ala lie Leu Gly Asp Leu Thr Ser 
795 800 805 

TCT GAC TTT TAC AAT AAT TCC ATA TCC TCA AAT GGT AGT CAT CTG GGG 2 56 0 

40 Ser Asp Phe Tyr Asn Asn Ser lie Ser Ser Asn Gly Ser His Leu Gly 
810 815 820 825 

ACT AAG CAA CAG GTG TTT CAA GGA ACT AAT TCT CTG GGT TTG AAA AGT 2 608 

Thr Lys Gin Gin Val Phe Gin Gly Thr Asn Ser Leu Gly Leu Lys Ser 
45 830 835 840 

TCA CAG TCT GTG CAG TCT ATT CGT CCT CCA TAT AAC CGA GCA GTG TCT 2656 
Ser Gin Ser Val Gin Ser lie Arg Pro Pro Tyr Asn Arg Ala Val Ser 
845 850 855 

50 

CTG GAT AGC CCT GTT TCT GTT GGC TCA AGT CCT CCA GTA AAA AAT ATC 2 704 

Leu Asp Ser Pro Val Ser Val Gly Ser Ser Pro Pro Val Lys Asn lie 
860 865 870 

55 AGT GCT TTC CCC ATG TTA CCA AAG CAA CCC ATG TTG GGT GGG AAT CCA 2 752 

Ser Ala Phe Pro Met Leu Pro Lys Gin Pro Met Leu Gly Gly Asn Pro 
875 880 885 



AGA ATG ATG GAT AGT CAG GAA AAT TAT GGC TCA AGT ATG GGT GGG CCA 2 800 

Arg Met Met Asp Ser Gin Glu Asn Tyr Gly Ser Ser Met Gly Gly Pro 
890 895 900 905 

5 

AAC CGA AAT GTG ACT GTG ACT CAG ACT CCT TCC TCA GGA GAC TGG GGC 2 84 8 

Asn Arg Asn Val Thr Val Thr Gin Thr Pro Ser Ser Gly Asp Trp Gly 
910 915 920 

10 TTA CCA AAC TCA AAG GCC GGC AGA ATG GAA CCT ATG AAT TCA AAC TCC 2 8 96 

Leu Pro Asn Ser Lys Ala Gly Arg Met Glu Pro Met Asn Ser Asn Ser 
925 930 935 

ATG GGA AGA CCA GGA GGA GAT TAT AAT ACT TCT TTA CCC AGA CCT GCA 2 944 

15 Met Gly Arg Pro Gly Gly Asp Tyr Asn Thr Ser Leu Pro Arg Pro Ala 
940 945 950 

CTG GGT GGC TCT ATT CCC ACA TTG CCT CTT CGG TCT AAT AGC ATA CCA 2 992 

Leu Gly Gly Ser lie Pro Thr Leu Pro Leu Arg Ser Asn Ser lie Pro 
20 955 960 965 

GGT GCG AGA CCA GTA TTG CAA CAG CAG CAG CAG ATG CTT CAA ATG AGG 3 04 0 

Gly Ala Arg Pro Val Leu Gin Gin Gin Gin Gin Met Leu Gin Met Arg 
970 975 980 985 

25 

CCT GGT GAA ATC CCC ATG GGA ATG GGG GCT AAT CCC TAT GGC CAA GCA 3 08 8 

Pro Gly Glu lie Pro Met Gly Met Gly Ala Asn Pro Tyr Gly Gin Ala 
990 995 1000 

30 GCA GCA TCT AAC CAA CTG GGT TCC TGG CCC GAT GGC ATG TTG TCC ATG 313 6 

Ala Ala Ser Asn Gin Leu Gly Ser Trp Pro Asp Gly Met Leu Ser Met 
1005 1010 1015 

GAA CAA GTT TCT CAT GGC ACT CAA AAT AGG CCT CTT CTT AGG AAT TCC 3184 
35 Glu Gin Val Ser His Gly Thr Gin Asn Arg Pro Leu Leu Arg Asn Ser 
1020 1025 1030 

CTG GAT GAT CTT GTT GGG CCA CCT TCC AAC CTG GAA GGC CAG AGT GAC 32 32 

Leu Asp Asp Leu Val Gly Pro Pro Ser Asn Leu Glu Gly Gin Ser Asp 
40 1035 1040 1045 

GAA AGA GCA TTA TTG GAC CAG CTG CAC ACT CTT CTC AGC AAC ACA GAT 32 8 0 

Glu Arg Ala Leu Leu Asp Gin Leu His Thr Leu Leu Ser Asn Thr Asp 
1050 1055 1060 1065 

45 

GCG ACA GGC CTG GAA GAA ATT GAC AGA GCT TTG GGC ATT CCT GAA CTT 33 2 8 

Ala Thr Gly Leu Glu Glu lie Asp Arg Ala Leu Gly lie Pro Glu Leu 
1070 1075 1080 

50 GTC AAT CAG GGA CAG GCA TTA GAG CCC AAA CAG GAT GCT TTC CAA GGC 33 76 

Val Asn Gin Gly Gin Ala Leu Glu Pro Lys Gin Asp Ala Phe Gin Gly 
1085 1090 1095 

CAA GAA GCA GCA GTA ATG ATG GAT CAG AAG GCA GGA TTA TAT GGA CAG 3424 
55 Gin Glu Ala Ala Val Met Met Asp Gin Lys Ala Gly Leu Tyr Gly Gin 
1100 1105 1110 
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ACA TAC CCA GCA CAG GGG CCT CCA ATG CAA GGA GGC TTT CAT CTT CAG 34 72 

Thr Tyr Pro Ala Gin Gly Pro Pro Met Gin Gly Gly Phe His Leu Gin 
1115 1120 1125 

5 GGA CAA TCA CCA TCT TTT AAC TCT ATG ATG AAT CAG ATG AAC CAG CAA 352 0 

Gly Gin Ser Pro Ser Phe Asn Ser Met Met Asn Gin Met Asn Gin Gin 
1130 1135 1140 1145 

GGC AAT TTT CCT CTC CAA GGA ATG CAC CCA CGA GCC AAC ATC ATG AGA 3 568 

10 Gly Asn Phe Pro Leu Gin Gly Met His Pro Arg Ala Asn lie Met Arg 

1150 1155 1160 

CCC CGG ACA AAC ACC CCC AAG CAA CTT AGA ATG CAG CTT CAG CAG AGG 3616 
Pro Arg Thr Asn Thr Pro Lys Gin Leu Arg Met Gin Leu Gin Gin Arg 
15 1165 1170 1175 

CTG CAG GGC CAG CAG TTT TTG AAT CAG AGC CGA CAG GCA CTT GAA TTG 3664 
Leu Gin Gly Gin Gin Phe Leu Asn Gin Ser Arg Gin Ala Leu Glu Leu 
1180 1185 1190 

20 

AAA ATG GAA AAC CCT ACT GCT GGT GGT GCT GCG GTG ATG AGG CCT ATG 3 712 

Lys Met Glu Asn Pro Thr Ala Gly Gly Ala Ala Val Met Arg Pro Met 
1195 1200 1205 

25 ATG CAG CCC CAG CAG GGT TTT CTT AAT GCT CAA ATG GTC GCC CAA CGC 3 76 0 

Met Gin Pro Gin Gin Gly Phe Leu Asn Ala Gin Met Val Ala Gin Arg 
1210 1215 1220 1225 

AGC AGA GAG CTG CTA AGT CAT CAC TTC CGA CAA CAG AGG GTG GCT ATG 3 8 08 

30 Ser Arg Glu Leu Leu Ser His His Phe Arg Gin Gin Arg Val Ala Met 

1230 1235 1240 

ATG ATG CAG CAG CAG CAA CAG CAG CAG CAG CAG CAG CAG CAG CAG CAA 3 8 56 

Met Met Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
35 1245 1250 1255 

CAG CAA CAG CAA CAG CAG CAA CAG CAG CAA ACC CAG GCC TTC AGC CCA 3 904 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Thr Gin Ala Phe Ser Pro 
1260 1265 1270 

40 

CCT CCT AAT GTG ACT GCT TCC CCC AGC ATG GAT GGG CTT TTG GCA GGA 3 952 

Pro Pro Asn Val Thr Ala Ser Pro Ser Met Asp Gly Leu Leu Ala Gly 
1275 1280 1285 

45 CCC ACA ATG CCA CAA GCT CCT CCG CAA CAG TTT CCA TAT CAA CCA AAT 4 000 

Pro Thr Met Pro Gin Ala Pro Pro Gin Gin Phe Pro Tyr Gin Pro Asn 
1290 1295 1300 1305 

TAT GGA ATG GGA CAA CAA CCA GAT CCA GCC TTT GGT CGA GTG TCT AGT 4 04 8 

50 Tyr Gly Met Gly Gin Gin Pro Asp Pro Ala Phe Gly Arg Val Ser Ser 

1310 1315 1320 

CCT CCC AAT GCA ATG ATG TCG TCA AGA ATG GGT CCC TCC CAG AAT CCC 4 096 

Pro Pro Asn Ala Met Met Ser Ser Arg Met Gly Pro Ser Gin Asn Pro 
55 1325 1330 1335 

ATG ATG CAA CAC CCG CAG GCT GCA TCC ATC TAT CAG TCC TCA GAA ATG 4144 
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Met Met Gin His Pro Gin Ala Ala Ser lie Tyr Gin Ser Ser Glu Met 
1340 1345 1350 

AAG GGC TGG CCA TCA GGA AAT TTG GCC AGG AAC AGC TCC TTT TCC CAG 4192 
5 Lys Gly Trp Pro Ser Gly Asn Leu Ala Arg Asn Ser Ser Phe Ser Gin 
1355 1360 1365 

CAG CAG TTT GCC CAC CAG GGG AAT CCT GCA GTG TAT AGT ATG GTG CAC 4 24 0 

Gin Gin Phe Ala His Gin Gly Asn Pro Ala Val Tyr Ser Met Val His 
10 1370 1375 1380 1385 

ATG AAT GGC AGC AGT GGT CAC ATG GGA CAG ATG AAC ATG AAC CCC ATG 42 8 8 

Met Asn Gly Ser Ser Gly His Met Gly Gin Met Asn Met Asn Pro Met 
1390 1395 1400 

15 

CCC ATG TCT GGC ATG CCT ATG GGT CCT GAT CAG AAA TAC TGC TGA CAT CT 4 33 8 
Pro Met Ser Gly Met Pro Met Gly Pro Asp Gin Lys Tyr Cys * 
1405 1410 1415 

20 CTGCACCAGG ACCTCTTAAG GAAACCACTG TACAAATGAC ACTGCACTAG GATTATTGGG 4 3 98 

f- AAGGAATCAT TGTTCCAGGC ATCCATCTTG GAAGAAAGGA CCAGCTTTGA GCTCCATCAA 44 5 8 

J GGGTATTTTA AGTGATGTCA TTTGAGCAGG AATTCTAG 44 96 
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(2) INFORMATION FOR SEQ ID NO : 2 : 



Jl (i) SEQUENCE CHARACTERISTICS: 

^' 30 (A) LENGTH: 1417 amino acids 

Q (B) TYPE: amino acid 

:7; (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Gly Leu Gly Glu Asn Leu Asp Pro Leu Ala Ser Asp Ser Arg 
15 10 15 

Lys Arg Lys Leu Pro Cys Asp Thr Pro Gly Gin Gly Leu Thr Cys Ser 
20 25 30 



Gly Glu Lys Arg Arg Arg Glu Gin Glu Ser Lys Tyr lie Glu Glu Leu 
45 35 40 45 

Ala Glu Leu lie Ser Ala Asn Leu Ser Asp lie Asp Asn Phe Asn Val 
50 55 60 

50 Lys Pro Asp Lys Cys Ala lie Leu Lys Glu Thr Val Arg Gin lie Arg 
65 70 75 80 

Gin lie Lys Glu Gin Gly Lys Thr lie Ser Asn Asp Asp Asp Val Gin 
85 90 95 

55 

Lys Ala Asp Val Ser Ser Thr Gly Gin Gly Val lie Asp Lys Asp Ser 
100 105 110 



Leu Gly Pro Leu Leu Leu Gin Ala Leu Asp Gly Phe Leu Phe Val Val 
115 120 125 

5 Asn Arg Glu Ala Asn lie Val Phe Val Ser Glu Asn Val Thr Gin Tyr 
130 135 140 

Leu Gin Tyr Lys Gin Glu Asp Leu Val Asn Thr Ser Val Tyr Asn lie 
145 150 155 160 

10 

Leu His Glu Glu Asp Arg Lys Asp Phe Leu Lys Asn Leu Pro Lys Ser 
165 170 175 

Thr Val Asn Gly Val Ser Trp Thr Asn Glu Thr Gin Arg Gin Lys Ser 
15 180 185 190 

His Thr Phe Asn Cys Arg Met Leu Met Lys Thr Pro His Asp lie Leu 
195 200 205 

20 Glu Asp lie Asn Ala Ser Pro Glu Met Arg Gin Arg Tyr Glu Thr Met 
_ 210 215 220 

Gin Cys Phe Ala Leu Ser Gin Pro Arg Ala Met Met Glu Glu Gly Glu 
Q 225 230 235 240 

=p25 

M Asp Leu Gin Ser Cys Met lie Cys Val Ala Arg Arg lie Thr Thr Gly 

245 250 255 

^ Glu Arg Thr Phe Pro Ser Asn Pro Glu Ser Phe lie Thr Arg His Asp 

30 260 265 270 

JTJ Leu Ser Gly Lys Val Val Asn lie Asp Thr Asn Ser Leu Arg Ser Ser 

275 280 285 

^^35 Met Arg Pro Gly Phe Glu Asp lie lie Arg Arg Cys lie Gin Arg Phe 
S 290 295 300 

Phe Ser Leu Asn Asp Gly Gin Ser Trp Ser Gin Lys Arg His Tyr Gin 
305 310 315 320 

40 

Glu Ala Tyr Leu Asn Gly His Ala Glu Thr Pro Val Tyr Arg Phe Ser 
325 330 335 

Leu Ala Asp Gly Thr lie Val Thr Ala Gin Thr Lys Ser Lys Leu Phe 
45 340 345 350 

Arg Asn Pro Val Thr Asn Asp Arg His Gly Phe Val Ser Thr His Phe 
355 360 365 

50 Leu Gin Arg Glu Gin Asn Gly Tyr Arg Pro Asn Pro Asn Pro Val Gly 
370 375 380 

Gin Gly lie Arg Pro Pro Met Ala Gly Cys Asn Ser Ser Val Gly Gly 
385 390 395 400 

55 

Met Ser Met Ser Pro Asn Gin Gly Leu Gin Met Pro Ser Ser Arg Ala 
405 410 415 
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Tyr Gly Leu Ala Asp Pro Ser Thr Thr Gly Gin Met Ser Gly Ala Arg 
420 425 430 

Tyr Gly Gly Ser Ser Asn lie Ala Ser Leu Thr Pro Gly Pro Gly Met 
435 440 445 

Gin Ser Pro Ser Ser Tyr Gin Asn Asn Asn Tyr Gly Leu Asn Met Ser 
450 455 460 

Ser Pro Pro His Gly Ser Pro Gly Leu Ala Pro Asn Gin Gin Asn lie 
465 470 475 480 

Met lie Ser Pro Arg Asn Arg Gly Ser Pro Lys lie Ala Ser His Gin 
485 490 495 

Phe Ser Pro Val Ala Gly Val His Ser Pro Met Ala Ser Ser Gly Asn 
500 505 510 

Thr Gly Asn His Ser Phe Ser Ser Ser Ser Leu Ser Ala Leu Gin Ala 
515 520 525 

lie Ser Glu Gly Val Gly Thr Ser Leu Leu Ser Thr Leu Ser Ser Pro 
530 535 540 

Gly Pro Lys Leu Asp Asn Ser Pro Asn Met Asn lie Thr Gin Pro Ser 
545 550 555 560 

Lys Val Ser Asn Gin Asp Ser Lys Ser Pro Leu Gly Phe Tyr Cys Asp 
565 570 575 

Gin Asn Pro Val Glu Ser Ser Met Cys Gin Ser Asn Ser Arg Asp His 
580 585 590 

Leu Ser Asp Lys Glu Ser Lys Glu Ser Ser Val Glu Gly Ala Glu Asn 
595 600 605 

Gin Arg Gly Pro Leu Glu Ser Lys Gly His Lys Lys Leu Leu Gin Leu 
610 615 620 

Leu Thr Cys Ser Ser Asp Asp Arg Gly His Ser Ser Leu Thr Asn Ser 
625 630 635 640 

Pro Leu Asp Ser Ser Cys Lys Glu Ser Ser Val Ser Val Thr Ser Pro 
645 650 655 

Ser Gly Val Ser Ser Ser Thr Ser Gly Gly Val Ser Ser Thr Ser Asn 
660 665 670 

Met His Gly Ser Leu Leu Gin Glu Lys His Arg lie Leu His Lys Leu 
675 680 685 

Leu Gin Asn Gly Asn Ser Pro Ala Glu Val Ala Lys lie Thr Ala Gin 
690 695 700 



Ala Thr Gly Lys Asp Thr Ser Ser lie Thr Ser Cys Gly Asp Gly Asn 
705 710 715 720 
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Val Val Lys Gin Glu Gin Leu Ser Pro Lys Lys Lys Glu Asn Asn Ala 
725 730 735 

5 Leu Leu Arg Tyr Leu Leu Asp Arg Asp Asp Pro Ser Asp Ala Leu Ser 
740 745 750 

Lys Glu Leu Gin Pro Gin Val Glu Gly Val Asp Asn Lys Met Ser Gin 
755 760 765 

10 

Cys Thr Ser Ser Thr lie Pro Ser Ser Ser Gin Glu Lys Asp Pro Lys 
770 775 780 

lie Lys Thr Glu Thr Ser Glu Glu Gly Ser Gly Asp Leu Asp Asn Leu 
15 785 790 795 800 

Asp Ala lie Leu Gly Asp Leu Thr Ser Ser Asp Phe Tyr Asn Asn Ser 
805 810 815 

20 lie Ser Ser Asn Gly Ser His Leu Gly Thr Lys Gin Gin Val Phe Gin 
820 825 830 

Gly Thr Asn Ser Leu Gly Leu Lys Ser Ser Gin Ser Val Gin Ser lie 
835 840 845 

25 

Arg Pro Pro Tyr Asn Arg Ala Val Ser Leu Asp Ser Pro Val Ser Val 
850 855 860 

Gly Ser Ser Pro Pro Val Lys Asn lie Ser Ala Phe Pro Met Leu Pro 
30 865 870 875 880 

Lys Gin Pro Met Leu Gly Gly Asn Pro Arg Met Met Asp Ser Gin Glu 
885 890 895 

35 Asn Tyr Gly Ser Ser Met Gly Gly Pro Asn Arg Asn Val Thr Val Thr 
900 905 910 

Gin Thr Pro Ser Ser Gly Asp Trp Gly Leu Pro Asn Ser Lys Ala Gly 
915 920 925 

40 

Arg Met Glu Pro Met Asn Ser Asn Ser Met Gly Arg Pro Gly Gly Asp 
930 935 940 

Tyr Asn Thr Ser Leu Pro Arg Pro Ala Leu Gly Gly Ser lie Pro Thr 
45 945 950 955 960 

Leu Pro Leu Arg Ser Asn Ser lie Pro Gly Ala Arg Pro Val Leu Gin 
965 970 975 

50 Gin Gin Gin Gin Met Leu Gin Met Arg Pro Gly Glu lie Pro Met Gly 
980 985 990 

Met Gly Ala Asn Pro Tyr Gly Gin Ala Ala Ala Ser Asn Gin Leu Gly 
995 1000 1005 

55 

Ser Trp Pro Asp Gly Met Leu Ser Met Glu Gin Val Ser His Gly Thr 
1010 1015 1020 
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Gln Asn Arg Pro Leu Leu Arg Asn Ser Leu Asp Asp Leu Val Gly Pro 
1025 1030 1035 1040 

Pro Ser Asn Leu Glu Gly Gin Ser Asp Glu Arg Ala Leu Leu Asp Gin 
1045 1050 1055 

Leu His Thr Leu Leu Ser Asn Thr Asp Ala Thr Gly Leu Glu Glu lie 
1060 1065 1070 

Asp Arg Ala Leu Gly lie Pro Glu Leu Val Asn Gin Gly Gin Ala Leu 
1075 1080 1085 

Glu Pro Lys Gin Asp Ala Phe Gin Gly Gin Glu Ala Ala Val Met Met 
1090 1095 1100 

Asp Gin Lys Ala Gly Leu Tyr Gly Gin Thr Tyr Pro Ala Gin Gly Pro 
1105 1110 1115 1120 

Pro Met Gin Gly Gly Phe His Leu Gin Gly Gin Ser Pro Ser Phe Asn 
1125 1130 1135 

Ser Met Met Asn Gin Met Asn Gin Gin Gly Asn Phe Pro Leu Gin Gly. 
1140 1145 1150 

Met His Pro Arg Ala Asn lie Met Arg Pro Arg Thr Asn Thr Pro Lys 
1155 1160 1165 

Gin Leu Arg Met Gin Leu Gin Gin Arg Leu Gin Gly Gin Gin Phe Leu 
1170 1175 1180 

Asn Gin Ser Arg Gin Ala Leu Glu Leu Lys Met Glu Asn Pro Thr Ala 
1185 1190 1195 1200 

Gly Gly Ala Ala Val Met Arg Pro Met Met Gin Pro Gin Gin Gly Phe 
1205 1210 1215 

Leu Asn Ala Gin Met Val Ala Gin Arg Ser Arg Glu Leu Leu Ser His 
1220 1225 1230 

His Phe Arg Gin Gin Arg Val Ala Met Met Met Gin Gin Gin Gin Gin 
1235 1240 1245 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
1250 1255 1260 

Gin Gin Gin Thr Gin Ala Phe Ser Pro Pro Pro Asn Val Thr Ala Ser 
1265 1270 1275 1280 

Pro Ser Met Asp Gly Leu Leu Ala Gly Pro Thr Met Pro Gin Ala Pro 
1285 1290 1295 

Pro Gin Gin Phe Pro Tyr Gin Pro Asn Tyr Gly Met Gly Gin Gin Pro 
1300 1305 1310 



Asp Pro Ala Phe Gly Arg Val Ser Ser Pro Pro Asn Ala Met Met Ser 
1315 1320 1325 
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Ser Arg Met Gly Pro Ser Gin Asn Pro Met Met Gin His Pro Gin Ala 
1330 1335 1340 

5 Ala Ser lie Tyr Gin Ser Ser Glu Met Lys Gly Trp Pro Ser Gly Asn 
1345 1350 1355 1360 

Leu Ala Arg Asn Ser Ser Phe Ser Gin Gin Gin Phe Ala His Gin Gly 
1365 1370 1375 

10 

Asn Pro Ala Val Tyr Ser Met Val His Met Asn Gly Ser Ser Gly His 
1380 1385 1390 

Met Gly Gin Met Asn Met Asn Pro Met Pro Met Ser Gly Met Pro Met 
15 1395 1400 1405 



Gly Pro Asp Gin Lys Tyr Cys * 
1410 1415 



